An Efficient Text Clustering Framework
نویسندگان
چکیده
منابع مشابه
Efficient streaming text clustering
Clustering data streams has been a new research topic, recently emerged from many real data mining applications, and has attracted a lot of research attention. However, there is little work on clustering high-dimensional streaming text data. This paper combines an efficient online spherical k-means (OSKM) algorithm with an existing scalable clustering strategy to achieve fast and adaptive clust...
متن کاملAn Efficient Curvelet Framework for Denoising Images
Wiener filter suppresses noise efficiently. However, it makes the out image blurred. Curvelet preserves the edges of natural images perfectly, but, it produces visual distortion artifacts and fuzzy edges to the restored image, especially in homogeneous regions of images. In this paper, a new image denoising framework based on Curvelet transform and wiener filter is proposed, which can stop nois...
متن کاملAn Efficient Approach for Text Clustering Based on Frequent Itemsets
In recent times, the vast amount of textual information available in electronic form is growing at staggering rate. This increasing number of textual data has led to the task of mining useful or interesting frequent itemsets (words/terms) from very large text databases and still it seems to be quite challenging. The use of such frequent itemsets for text clustering has received a great deal of ...
متن کاملAn Efficient Clustering Algorithm for Text Mining Using Greedy Approach
I. Introduction " Data Mining " involves the integration of concepts from computer science, mathematics, and statistics. It seeks to extract useful information and detect interesting correlation and patterns from any form of data, especially numeric data. Data Mining is most associated with the broader process of Knowledge Discovery in Databases (KDD), " the nontrivial process of identifying va...
متن کاملA Text Clustering Framework for Information Retrieval
Text-mining methods have become a key feature for homeland-security technologies, as they can help explore effectively increasing masses of digital documents in the search for relevant information. This research presents a model for document clustering that arranges unstructured documents into content-based homogeneous groups. The overall paradigm is hybrid because it combines pattern-recogniti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2013
ISSN: 0975-8887
DOI: 10.5120/13763-1607